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ABSTRACT Microarrays containing 1046 human cDNAs 
of unknown sequence were printed on glass with high-speed 
robotics. These 1.0-cm 2 DNA "chips" were used to quantita- 
tively monitor differential expression of the cognate human 
genes using a highly sensitive two-color hybridization assay. 
Arrav elements that displayed differential expression patterns 
under given experimental conditions were characterized by 
sequencing. The identification of known and novel heat shock 
and phorbol ester-regulated genes in human T cells demon- 
strates the sensitivity of the assay. Parallel gene analysis with 
microarrays provides a rapid and efficient method for large- 
scale human gene discovery. 

Biology has entered the genome era (1). Complete genome 
sequences for all of the model organisms and human will 
probablv be available by the year 2003 (2). Torrents of human 
expressed sequence tags (ESTs) provide a starting point for 
elucidating the function of tens of thousands of cognate genes 
(3). Genome analysis will provide insights into growth, devel- 
opment, differentiation, homeostasis, aging, and the onset of 
diseases (1-3). A detailed understanding of the human genome 
will require the implementation of sophisticated methods for 
gene expression analysis and gene discovery. 

Recently, a microarray-based method for high-throughput 
monitoring of plant gene expression was described (4). This 
'chip"-based approach involved using microarrays of cDNA 
clones as gene-specific hybridization targets to quantitatively 
measure expression of the corresponding plant genes (4, 5). A 
two-color fluorescence labeling and detection scheme facili- 
tated sensitive differential expression analysis of different 
plant tissues (4, 5). The efficiency of this approach for studies 
in higher plants suggested the use of this method for human 
genome analysis (4-7). Here, we report the use of cDNA 
microarrays for human gene expression monitoring, biological 
investigation, and gene discovery. 

MATERIALS AND METHODS 

Human cDNA Clones. The cDNA library was made with 
mRNA from human peripheral blood lymphocytes trans- 
formed with the Epstein-Barr virus. Inserts >600 bp were 
cloned into the lambda vector AYES-R to generate 10 7 -10» 
recombinants. Bacterial transformants were obtained by in- 
fecting E. coli strain JM107/AKC. Colonies were picked at 
random and propagated in a 96-weIl format, and minilysate 
DNA was prepared by alkaline lysis using REAL preps 
(Qiagen, Chatsworth, CA). Inserts were amplified by PCR in 
a 96-well format using primers (PAN132, 5-CCTC- 
TATACTTTAACGTCAAGG; and PAN133, 5'-TTGTGTG- 
GAATTGTGAGCGG) complementary to the AYES 
polylinker and containing a six-carbon amino modification 
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(Glen Research, Sterling, VA) on the 5' end. PCR products 
were purified in a 96-well format using QIAquick columns 
(Qiagen). 

Microarray Preparation. Amino-modified PCR products 
were suspended at a concentration of 0.5 mg/ml in 3x 
standard saline citrate (SSC) and arrayed from 96-well micro- 
titer plates onto silylated microscope slides (CEL Associates, 
Houston) using high-speed robotics (4-7). A total of 1056 
cDNAs. representing 1046 human clones and 10 Arabidopsis 
controls, were arrayed in 1.0-cm : areas. Printed arrays were 
incubated for 4 hr in a humid chamber to allow rehydration of 
the arrav elements and rinsed, once in 0.2% SDS for 1 min, 
twice in H : 0 for 1 min, and once for 5 min in sodium 
borohydride" solution (1.0 g of NaBH4 dissolved in 300 ml of 
PBS and 100 ml of 100% ethanoi). The arrays were submerged 
in H 2 0 for 2 min at 95°C, transferred quickly into 0.2% SDS 
for l min, rinsed twice in H 2 O t air dried, and stored in the dark 
at 25X. 

Fluorescent Probes. Tissue mRNAs were purchased 
(CLONTECH). Jurkat mRNA was isolated as described by 
Schena et ai (4). Probes were made as described (4) with 
several modifications. The reverse transcriptase used here was 
Superscript II RNase H- (GIBCO). The Cy5-dCTP was 
purchased from Amersham. Each reverse transcription reac- 
tion contained 3.0 jig of total human mRNA. Arabidopsis 
control mRNAs were made by in vitro transcription of cloned 
HAT4, HAT22, and YesAt-23 cDNAs (4. 8, 9) using an RNA 
Transcription Kit (Stratagene). For quantitation, the mRNAs 
were doped into the reverse transcription reaction at ratios of 
1:100,000, 1:10,000, and 1:1000 (wt/wt) respectively. Following 
the reverse transcription step, samples were treated with 2.5 jil 
of 1 M sodium hydroxide for 10 min at 37°C, then neutralized 
bv adding 2.5 ^1 of 1 M Tris-HCl (pH 6.8) and 2.0 *il of 1 M 
HCL Probe mixtures contained cDNA products derived from 
3 ,ig of total mRNA. suspended in 5.0 ^1 of hybridization 
buffer (5x SSC plus 0.2% SDS). 

Hybridization and Scanning. Probes were hybridized to 
1.0-cm 2 microarrays under a 14 x 14 mm glass coverslip for 
6-12 hr at 60°C in a custom-built hybridization chamber (4-7). 
Arrays were washed for 5 min at room temperature (25°C) in 
low stringency wash buffer (lx SSC/0.2% SDS), then for 10 
min at room temperature in high stringency wash buffer (0.1 X 
SSC/0.2% SDS). Arrays were scanned in 0.1 x SSC using a 
fluorescence laser scanning device (4-7), fitted with a custom 
filter set (Chroma Technology, Brattleboro, VT). Accurate 
differential expression measurements (i.e., final fluorescence 
ratios) were obtained by taking the average of the ratios of two 
independent hybridizations. 



Abbreviation: EST, expressed sequence tag. 

Data deposition: The sequences reported in this paper have been 
deposited in the GenBank data base (accession nos. U56654-U56660). 
tTo whom reprint requests should be addressed, e-mail: schena<& 
cmgm.stanford.edu. 
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Cell Culture. Jurkat cells were grown in a tissue culture 
incubator <37°C and 5% CO : ) in RPMI medium supplemented 
with 10% fetal bovine serum, 100 \ig of streptomycin per ml 
and 500 units of penicillin per ml. Heat shock corresponded to 
a 4-hr incubation at 43 C C Phorbol ester treated cells were 
erown for 4 hr in the presence of 50 ng of phorbol 12-mynstate 
13-acetate (PMA) per ml. 

RNA Blotting. Dot blots were performed as described (4). 

DNA Sequencing. Sequences were obtained using the 
PAN132 and PAN133 primers and a 373A automated se- 
quencer, according to the instructions of the manufacturer 
(Applied Biosvstems). 

Computer Graphics and Informatics. Pseudocolor represen- 
tations of fluorescent images were made with National Institutes 
of Health image software (version 1.52). Software for differential 
expression representations was purchased from Imaging Re- 
search (St. Catherine's, ON, Canada). Sequence searches were 
made to the nonredundant nucleotide data base at the National 
Center for Biotechnology Information (NCBI) using Macintosh 
blast software. The EST data base was accessed via the World 
Wide Web (http:/www.ncbi.nlm.n ih.gov/). 

RESULTS 

Gene Discovery and the Heat Shock Response. Microarrays 
were used to examine the heat shock response in cultured 
human T (Jurkat) cells. Control (37°C) and heat-treated 
(43°C) cells were harvested and lysed, and total mRNA from 
the two cell samples was labeled by reverse transcriptase 
incorporation of fluorescein- and Cy5-dCTP, respectively. In 
a second set of labeling reactions, the fluorescent groups were 
"swapped" such that samples from control and heat-treated 
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samples were labeled with Cy5- and f luorescein-dCTP. respec- 
tively. Each pair of fluorescent probes was hybridized to a 
1056-element microarray. The arrays were washed at high 
stringency and scanned with a confocal laser scanning device 
to detect emission of the two fluorescent groups. 

Hybridization signals were observed to >95% of the human 
cDNA arrav elements, but not to any of the Arabidopsis 
negative controls (Fig. 1). Fluorescence intensities spanned 
more than three orders of magnitude for the 1046 array 
elements surveyed (Fig. 1). Comparative expression analysis of 
heat shocked versus control cells in the two experiments 
revealed 17 arrav elements that displayed altered fluorescence 
ratios of >2.0-'fold (Figs. 1 and 24). Of the 17 putative 
differentially expressed genes. 11 were induced by heat shock 
treatment and 6 displayed modest repression (Figs. 1 and 2A ). 

To determine the identity of the heat-regulated genes. 
cDNAs corresponding to each of the 17 array elements were 
sequenced on the proximal and distal end. Data base searches 
revealed perfect matches for 14 of the 17 clones, and in each 
case proximal and distal cDNA sequences mapped to the same 
gene (Table l). Of the 1046 human genes examined on the 
microarraw the five most highly induced in heat-treated cells 
were heat' shock protein 90a (hsp90a), dnaJ, hsp90j3, polyu- 
biquitin, and t-complex polypeptide- 1 (tcp-1) (Table 1). Three 
of the 17 clones did not match any entry in the public data base, 
though one of the clones (B7) exhibited significant homology 
to an EST from Caenorhabditis elegans (Table 1). Each of the 
novel sequences (B7-B9) exhibited * 2- fold induction (Table 1) 
and relatively low-level expression (Table 2). 

To confirm the microarray results, mRNA levels for each of 
the genes were measured by RNA blotting. Each of the genes 
that displayed heat shock induction, including the three novel 
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Expression Ratios 
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Table 1. Microarray elements corresponding to differentially exp ressed genes 
Clone 



Row 



Column 



Ratio 



Bl 

B2 

B3 

B4 

B5 

B6 

B7* 

B8 

B9 

BIO 

Bll 

B12 

B13 

BI4 

B15 

B16 

B17 

B18 

B19 

B20 

B21 

B22 

B23 



24 
1 
15 
32 
17 
22 
5 
2 

14 

7 

12 
28 
' 14 
20 
30 
10 
13 

7 
21 

3 

1 

22 
20 



21 
31 
8 

19 
8 

31 
4 

19 
5 
8 
2 
2 
7 
9 

12 
5 

16 

19 

30 

26 

18 

30 

16 



Blast identity 



Accession no. 



0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

2.0 

2.0 

2 t 

2.4 

2.4 

2.5 

2.5 

2.6 

4.0 

5.8 

6.3 

2.0 

2.1 

2.2 

2.6 

3.5 

19 



CYC oxidase III 
0-Actin 

CYC oxidase III 
CYC oxidase III 
CYC oxidase III 
0- Act in 
Novel* 
Novel* 
Novel* 
Polyubiquitin 
TCP-1 

Polyubiquitin 

Polyubiquitin 

HSP90/3 

DnaJ homolog 

HSP90q 

HSP90q 

^-microglobulin 
Novel* 

fN-microglobulin 
PGK 
NF-kB1 
PAC-1 



J01415. J01415 
NR. X00351 * 
J01415, J01415 
J01415, J01415 
J01415, J01415 
NR, X00351 
U56653, U56654 
U56655, U56656 
U56657, U56658 
X04803. X04K03 
X528S2. X52882 
M17597, M17597 
X04803, X04803 
Ml 6660, Ml 6660 
D 13388. D 13388 
X07270. X07270 
M27024, X15183 
S54761. M30683 
U56659, U56660 
S54761, M30683 
Mil 968. L00160 
Z47744, M55643 
LI 1329, LI 1329 



^. ; . liij^v. Lii.^y 

Clones showing >9&% identity over 300 ndI^^^tlZh?£3?i V . J es, "- ,reau:d ("18-23) Jurka. cells, 
except CYC oxidase III (mitochondrial! AmrinS assunled to be denlical to known sequences. All eenes are nuclear 
respectively. CYC. ^^^l^^^^J^^^*? <°' P"»™»' distai sequence traces. 
NF-kB. nuclear factor-kappaB: PAC-1. phosphate o7 S ted ,„ NP° C , Pr °' e,n; ^ P hos P h °?'y«ra.e kinase: 
poly(A)+ tract. pnuspnatase ol actuated cells, and NR, trace not readable due to the presence of 

•B7 is 67% identical to an EST from C. elegans (D76026) 
T No match in the public data bases. 
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Table 2. Human gene expression monitored by microarray and RNA blot analyses 



10617 



Clone 



Expression level, per 10 5 mRNAs 



Blast identity 



Bl CYC oxidase III 

B2 0-Actin 

B3 CYC oxidase III 

B4 CYC oxidase III 

B5 CYC oxidase III 

B6 0-Actin 

B7 Novel (weakly to D76026) 

B8 Novel 

B9 Novel 

BIO Poiyubiquitin 

Bll TCP-1 

B12 Poiyubiquitin 

B13 Poiyubiquitin 

B14 HSP9O0 

B15 DnaJ homolog 

B16 HSP90q 

B17 HSP90q 

B18 ^-microglobulin 

B19 Novel 

B20 fc-microglobulin 

B21 Phosphoglvcerate kinase 

B22 NF-KB1 

B23 PAC-1 



Microarray 


Ratio 


RNA blot 


Ratio 


92/46 


0.5 


100/80 


0.8 


240/120 


0.5 


270/280 


1.0 


36/18 


0.5 


ND 


ND 


76/38 


0.5 


ND 


ND 


62/31 


0.5 


ND 


ND 


180/89 


0.5 


ND 


ND 


1.3/2.6 


2.0 


0.77/1.8 


2.3 


2.0/4.0 


2.0 


1.5/3.4 


2.3 


0.8/1.8 


2.2 


1.2/1.8 


1.5 


0.8/1.9 


2.4 


25/89 


3.6 


2.3/5.5 


2.4 


7.1/27 


3.8 


0.8/2.0 


2.5 


ND 


ND 


1.7/4.3 


2.5 


ND 


ND 


75/200 


2.6 


30/120 


4.0 


1.0/4.0 


4.0 


1.6/13 


8.1 


0.6/3.5 


5.8 


3.2/29 


9.1 


0.8/5.0 


6.3 


8.6/62 


7.2 


1.0/2.0 


2.0 


5.4/15 


2.8 


1.2/2.5 


2.1 


4.5/9.5 


2.5 


2.7/5.9 


2.2 


ND 


ND 


2.4/6.2 


2.6 


4.7/9.2 


2.0 


1.7/6.0 


3.5 


0.65/4.7 


7.2 


0.5/9.5 


19 


0.21/15 


71 



S D h K 7 L f expression levels per 100,000 mRNAs (wt/wt) of genes assaved with a microarrav (Fie 1) 
or RNA blot Ratios correspond to values from cells subjected to heat shock (Bl-17) or phorbol ester 
treatment (B18-23) relatrve to untreated cells. Clone and gene names are given in Table 1 ND not 
determined. 



sequences, exhibited elevated mRNA levels by dot blot analysis 
(Table 2). In all cases, expression ratios as determined by the 
two procedures differed by <2-fold for the genes identified in 
the heat shock experiments (Table 2). The two assays differed 
more widely in terms of assessing absolute expression levels; 
nonetheless, absolute expression as monitored on a microarray 
typically correlated with RNA blots to within a factor of five 
(Table 2). 

Phorbol Ester Signaling. To explore a signaling pathway 
distinct from the heat shock response, microarrays were used 
to examine the cellular effects of phorbol ester treatment. 
Jurkat cells were treated with phorbol ester, harvested, lysed, 
and used as a source of mRNA. Samples of mRNA from 
untreated or phorbol ester-stimulated cells were labeled with 
reverse transcriptase. The probes were mixed, hybridized to 
microarrays, and scanned for fluorescence emission of the two 
fluorescent groups. A total of six array elements displayed 
>2.0-fold elevated signals with probes from phorbol ester- 
treated cells relative to control samples (Fig. 2B). 

To determine the identity of the phorbol ester-induced 
genes, clones corresponding to the six array elements were 
sequenced. Data base searches revealed perfect matches for 
five of the six sequences (Table 1). The two most highly 
induced genes were the PAC-1 tyrosine phosphatase and 
nuclear factor-kappa Bl (NF-kBI); modest activation was 
observed for phosphoglvcerate kinase and ^-microglobulin 
(Table 1). One remaining clone (B19) did not match any entry 
in the public data base (Table 1). B19 displayed a 2.1-fold 
induction and, similar to the novel heat shock genes, a rela- 
tively low absolute expression level (Tables 1 and 2). All six of 
the phorbol ester-inducible genes displayed increased steady- 
state mRNA levels by RNA blotting (Table 2). PAC J expres- 
sion (Fig. 1; Table 2) defined a detection limit of ^1:500,000 
for the assay. 

Transcript Imaging in Human Tissues. To determine 
whether microarrays could be used to monitor expression in 
human tissues, probes were prepared from human bone mar- 



row, brain, prostate, and heart by labeling each mRNA sample 
with Cy5-dCTP. In a separate reaction, a control probe was 
prepared by labeling Jurkat mRNA with fluorescein-dCTP. 
The four Cy5-labeled probes were each mixed with an aliquot 
of the fluorescein-labeled control sample, and the four mix- 
tures were hybridized to separate microarrays. The arrays were 
washed and scanned for fluorescence emission, and hybrid- 
ization signals for each of the tissues samples were normalized 
to the Jurkat control to generate an expression profile for each 
of the 1046 clones present on the array. 

Detectable expression was observed for all 15 of the heat 
shock and phorbol ester-regulated genes in the four tissue 
types examined (Fig. 3). In general, the expression level of each 
gene in Jurkat cells correlated rather closely with expression in 
the four tissues (Table 2; Fig. 3). Genes encoding 0-actin and 
cytochrome c oxidase, the two most highly expressed of the 15 
genes in Jurkat cells (Table 2), were highly expressed in bone 
marrow, brain, prostate, and heart (Fig. 3/1). Expression of 
cytochrome c oxidase, hsp90or, and the novel B7 sequence was 
significantly greater in heart than in the other tissues (Fig. 3). 

DISCUSSION 

Many of the heat shock genes identified in this study encode 
factors that function either as molecular "chaperones" 
(HSP90a, HSP9O0, DnaJ, TCP-1) or as mediators of protein 
degradation (poiyubiquitin). The identification of these se- 
quences is consistent with the biochemical basis of heat shock 
induction (10-15). Proteins undergo denaturation at elevated 
temperatures, and those that fail to maintain proper confor- 
mation must be selectively degraded (10-15). It will be inter- 
esting to determine whether the three novel heat shock- 
inducible sequences (B7-B9) mediate protein folding and 
turnover or possess some other biochemical activity. Complete 
nucleotide sequence determination, conceptual translation, 
expression monitoring, and biochemical analysis should pro- 
vide a detailed functional understanding of these genes. 
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Fig. 3. Transcript profiles of heat shock and phorbol ester- 
regulated genes. Gene expression levels per 100,000 mRNAs (r-axes) 
are shown for 15 genes (Table 1) in human bone marrow (red), brain 
(green), prostate (blue), and heart (yellow). Genes are erouped 
according to expression levels (A-C). 



Phorbol ester, a potent activator of protein kinase C( 16 17) 
induced a set of genes distinct from those involved in the heat 
shock pathway. The most highly induced gene identified in this 
study. PAC-1. encodes a nuclear tvrosine^kinase that mav plav 
a role m regulating transcription and cell axle progression 
(18). NF-kBI. a second phorbol ester-inducibie eene is an 
intensively studied member of the Rel transcription factor 
family (19-21). The Rel proteins are activated bv a large 
number of stimuli, including phorbol esters, cytokines, bacte- 
rial and virai pathogens, and ultraviolet lieht (19-21). Modest 
activation was observed for three sequences not known to be 
inducible by phorbol esters, including phosphoelvcerate ki- 
nase, ^-microglobulin, and a novel human gene"(B19). Ex- 
tensive expression monitoring with microarravs should assist in 
understanding how each of these genes inteerate into the 
highly complex phorbol ester signaling pathway. 

It is striking that four novel human" genes were discovered 
with an array of 1000 randomly chosen clones, particularly 
because the heat shock and phorbol ester signaling pathway's 
have been so intensively studied (10-21). The facile discovery 
of these sequences underscores the fact that microarravs can 
be used for gene discovery in the absence of anv sequence 
information. By this approach, clones are chosen'at random 
from any library of interest and only those clones that display 
interesting expression patterns are sequenced and character- 
ized. This parallel assay, coupled with a modest DNA sequenc- 
ing facility, allows high-throughput human genome expression 
analysis and gene discovery. 

Genes that are activated or repressed bv a given stimulus 
provide functional clues to the cellular 'pathway involved 
(22-24). Detailed examination of these gene expression "sig- 
natures" can provide a dynamic view of the mode of action of 
a given signaling substance (22-24). Microarravs may thus 
allow rapid mechanistic examination of hormones, drugs 
elicitors, and other small molecules; moreover, functional 
analysis of transcription factors, kinases, growth factors, cyto- 
kines, receptors, and other gene products should be possible 
Efforts are underway to develop mRNA amplification strate- 
gies to enable probe preparation from minute tissue samples 
This capability might allow for high-throughput patient screen- 
ing in a clinical setting. 

The current detection limit of the assay allows monitoring of 
transcripts that represent ^1:500.000 (wt/wt) of the total 
mRNA. This 10-fold increase in sensitivity compared with the 
original report (4) was achieved largely by modifying the 
coupling chemistry, which reduced background fluorescence. 
The significance of this improvement is considerable in that 
approximately half the human genes identified in this study 
including all four novel sequences, exhibited expression levels 
below the original detection limit of 1:50,000 (4). 

The ability to detect 2-fold changes in expression was 
achieved by the use of two-color fluorescence in the labeling 
and detection schemes, digitized data collection, and custom 
software. The importance of this capability is underscored by 
the fact that nearly all of the genes examined here exhibited 
<6-fold changes in expression. The four novel genes, which 
showed <2.2-fold activation, were probably overlooked in 
previous screens that used conventional differential expression 
techniques. It may be possible to further improve the precision 
of the microarray assay by the use of closely related fluorescent 
analogs, such as Cy3 and Cy5, in the labeling and hybridization 
reactions. 

Microarravs offer a number of advantages over other po- 
tential high-capacity approaches to expression analysis. The 
chip-based approach enables small hybridization volumes high 
array densities, and the use of fluorescence labeling and 
detection schemes. These features provide a set of perfor- 
mance specifications that are unattainable with filter-based 
approaches (25, 26). The use of cDNA clones provides hy- 
bridization specificity that is not readily attained with oligo- 
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nucleotide arrays (27-30). The parallel format of the assav 
provides a simultaneous differential expression readout for 
>1000 genes. This contrasts with sequencing-based methods 
which require serial data collection for expression analysis (3l! 
32). A commercial source of cDNA microarravs would greatly 
speed the use of a chip-based approach to expression analysis 
The availability of large numbers of ESTs (3) provides a'rich 
resource of human cDNA clones for microarraying. The 
>400,000 ESTs in the public data bases represent a significant 
subset of all human genes (3, 33). Microarravs of thousands of 
ESTs will provide a powerful analytical tool' for future human 
gene expression studies. The -100,000 genes in the human 
genome (2, 33) emphasize the need for microarravs of greater 
density. Attempts to improve microdeposition techniques are 
underway and should allow construction of arravs containing 
a complete set of human gene targets (http://cmgm.stanford 
edu/-schena/). Microarravs of ^100,000 cDNA elements 
would allow expression monitoring of the entire human ge- 
nome in a single hybridization. This capacity, coupled with 
detailed biochemical analysis of the individual gene products 
would greatly speed the functional analvsis of the human 
genome. 

We thank S. Elledge (selledge@bcm.tmc.edu) for the human cDNA 
library, Qiagen representatives for help with plasmid purification, and 
A. J. Smith and colleagues at the Protein and Nucleic Acid (PAN) 
facility (Stanford) for oligonucleotide svnthesis and DNA sequencing 
We also thank members of the Davis, Brown, and Smith laboratories 
for critical comments and helpful discussions and Svnteni employees 
for technical assistance. Support for R.W.D. was 'provided by' the 
National Science Foundation (MCB9106011) and National Institutes 
of Health (R37HG00198) and for P.O.B. by the National Institutes of 
Health (3R21HG00450) and Howard Hughes Medical Institute 
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Institute. 
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